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Abstract 



In this paper we present an algorithm that generates A;-noncrossing, cr-modular di- 
agrams with uniform probability. A diagram is a labeled graph of degree < 1 over 
n vertices drawn in a horizontal line with arcs in the upper half-plane. A 

fc-crossing in a diagram is a set of k distinct arcs (?2, J2), • • ■ , (ifc, jk) with 

the property i\ < %i < . . . < i& < j\ < ji < . . . < jk- A diagram without any 
/c-crossings is called a /c-noncrossing diagram and a stack of length a is a maximal 
sequence (i + 1, j — 1), • • • , (i+ (cr— 1), j — (a — 1))). A diagram is cr-modular 

if any arc is contained in a stack of length at least a. Our algorithm generates af- 
ter 0(n h ) preprocessing time, fc-noncrossing, cr-modular diagrams in 0(n) time and 
space complexity. 



Keywords: fc-noncrossing diagram, uniform generation, RSK-algorithm 
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1. Introduction 

A ribonucleic acid (RNA) molecule is the helical configuration of a primary structure 
of nucleotides, A, G, U and C, together with Watson-Crick (A-U, G-C) and (U- 
G) base pairs (arcs). It is well-known that RNA structures exhibit cross-serial 
nucleotide interactions, called pseudoknots. First recognized in the turnip yellow 
mosaic virus in [7], they are now known to be widely conserved in functional RNA 
molecules. 

Modular /c-noncrossing diagrams represent a model of RNA pseudoknot structures 
[HE], that is RNA structures exhibiting cross-serial base pairings. The particular 
case of modular noncrossing diagrams, i.e. RNA secondary structures have been 
extensively studied P El El E] • 

A diagram is a labeled graph over the vertex set [n] = {1, . . . ,n} with vertex de- 
grees not greater than one. The standard representation of a diagram is derived by 
drawing its vertices in a horizontal line and its arcs in the upper half-plane. A 
/c-crossing is a set of k distinct arcs (^2, J2), • • • , (4, jk) with the property 

ii < i 2 < ■ ■ ■ < ik < ji < 32 < ■ ■ ■ < jk- 

A diagram without any /c-crossings is called a /c-noncrossing diagram. Furthermore, 
a stack of length a is a maximal sequence of "parallel" arcs, 

(i + 1, j - 1), . . . , {1 + (a - 1), j -{a- 1))) 

and is also referred to as a a-stack. A /c-noncrossing diagram having only stacks of 
lengths one is called a core. 

Biophysical structures do not exhibit any isolated bonds. That is, any arc in their 
diagram representation is contained in a stack of length at least two. We call a 
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123456789 1011 123456789 1011 

FIGURE 1. &>noncrossing diagrams: a 4-noncrossing diagram (left) and 
a 2-noncrossing diagram (right). The arcs (2, 6), (4, 8) and (5,11) form a 
3-crossing in the left diagram. 

diagram, whose arcs are contained in stacks of lengths at least cr, cr-modular. Mod- 
ular, fc-noncrossing diagrams are likely candidates for natural molecular structures. 
Sequence lengths of interest for such structures range from 75-300 nucleotides. 

The main result of this paper is an algorithm that generates fc-noncrossing, cr- 
modular diagrams with uniform probability. Our construction is motivated by the 
ideas of [2], where a combinatorial algorithm has been presented that uniformly gen- 
erates /c-noncrossing diagrams in 0(n k ) time complexity. To be precise, we generate 
fc-noncrossing modular diagrams "locally" having a success rate that depends on 
specific parameters, see Fig. |2j 

The paper is organized in two sections. In Section [2] we lay the foundations for our 
main result by generating core diagrams with uniform probability. In Section [3] we 
introduce weighted cores and subsequently prove the main theorem. 



2. Core diagrams 

A shape A is a set of squares arranged in left-justified rows with weakly decreasing 
number of boxes in each row. A Young tableau is a filling in squares in the shape 
with numbers, which is weakly increasing in each row and strictly increasing in each 
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FIGURE 2. Uniformity and success-rate of Algorithm 2. We run Algo- 
rithm 2 for 5 x 10 6 times attempting to generate 3-noncrossing 2-modular 
diagrams over 20 vertices. 4, 354, 410 of these executions generate a mod- 
ular diagram. In (a) we display the frequency distribution of multiplicities 
(dots) and the Binomial distribution (curve). In (b) we display the success 
rate of Algorithm 2 as a function of n for the following classes of modular 
diagrams: k = 3, a = 2 ([!]), k = 4, a = 2 ([2]) and k = 5, a = 2 ([3]). 



column. A ^-tableau of A n is a sequence of shapes, 



such that A* is differ from A* 1 by at most one square. See Fig. [3] (a). 

According to [1] we have a bijection between /c-noncrossing diagrams and a *- 
tableaux of having at most (k — 1) rows. Let us make the bijection explicit: 
reading the ^-tableaux having n steps from left to right we do the following: if 
A* \ A'" 1 = +□, we insert % in the new square. Otherwise if A* \ X l ~ l = — □, we 
extract the unique entry j via inverse RSK algorithm [H |2] and form an arc (j, i). 
By inverse RSK algorithm we mean the following: given a Young tableau Y % of 
shape A* and a shape A* +1 such that A 4+1 \ X 1 = — □, there exists a unique entry j of 



= A , A 1 ,... 



A", 
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Y % and a Young tableau Y l+1 of shape A l+1 such that RSK-insertion of j into Y l+1 
recovers Y % . Finally, in case of A 1 \ A*" 1 = we do nothing, see Fig. [3j Given a 
/c-noncrossing diagram, we read the vertices from right to left and initialize X n = 0. 
If % is a terminal of an arc, we obtain A*" 1 by inserting j into A* via RSK 

insertion. If i is an isolated vertex we do nothing, and remove the square contain i 
when it is an origin of an arc, see Fig. [3J 




( e ) 0-^n-^g-^g-^p-^p-^m-^n-^g-^n-^0 

FIGURE 3. From *-tableaux to diagrams and back. Reading (a) from left 
to right, we insert i into the new square in case of A* \ A 4 " 1 being a — □- 
step and extract the square via inverse RSK if A* \ A*" 1 is a — D-step. The 
extraction leads to an arc. Reading (c) from right to left, A i_1 is obtained 
by RSK insertion of j into A* if i is the terminal of an arc. We do nothing 
if i is an isolated vertex and we remove the square with entry i in case of 
i being an origin of an arc. 

Let 

Ti(X) = {(A-OcKjXi | (A J )j is a ^-tableau having at most {k — 1) rows and A* = A}. 
Any d G Tj(A) induces a unique arc-set A{d). We set Aq(i9) = and do the following 
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• for a +E-step, we insert h into the new square, 

• for a 0-step, we do nothing, 

• for a — El-step, we extract the unique entry, j(h), of the tableaux Y 11 ' 1 which, 
if RSK-inserted into Y h , recovers Y h ~ l and set A h (d) = A /l „ 1 (^)U{(j(/i), h)}. 

Setting A($) = A^) we obtain an induced arc set A($), as well as a unique sequence 
of Young tableaux Y($) = {Y° = 0, Y 1 , . . . , Y 1 }, where for h < i, Y h is a Young 
tableau of shape X h . These extractions generate a set of arcs i), which in turn 
uniquely determines a fc-noncrossing diagram. 

Lemma 1. Suppose r > 1 and G ^i(A) is a *-tableaux such that 

(p, q), (p + 1, q - 1), . . . , (p + r, q - r) 

are stacked pairs of insertion- extraction steps. Let /(^ P) q, r ) € ^(A) be the ^-tableaux 
in which all r insertion- extraction pairs (jp + 1, q — 1), . . . , (p + r, q — r) are replaced 
by 2r 0-steps. Then we have a correspondence between $ p , g , r and fi'&p^r)- 

Proof. Let Y($ p q r ) denote its associated sequence of Young tableaux, 

(2.1) (Y') o < t < ? , = (Y° = 0,Y 1 ,...,Y i )- 
We next construct a new sequence of Young tableaux, 

(2.2) Y(f(0 M )) = {J°,J 1 ,...,J n = Y i }, 
from right to left via the following algorithm 

• for a — El-step of the original ^-tableaux, let j be the unique entry 
extracted from Y 1 ^ 1 which if RSK-inserted into Y l recovers Y*" 1 . If t = 
q, q — 1, . . . , q—r we do nothing, otherwise: J l ~ l is obtained by RSK-insertion 
of j into J', 

• for a 0-step, we do nothing, 



7 



• for a +D-step, if t = p+1, ■ ■ ■ , p+r, we do nothing, otherwise J 1 1 is obtained 
by removing the square with entry t from J*. 

By construction, J° = and considering the induced sequence of shapes of the 
sequence of Young tableaux J°, . . . , J 1 we obtain a unique ^-tableau f{$p,q, r )- By 
construction f{~&p iq> r) has 0-steps at step p + 1, . . . , p + r and steps q — 1, . . . , q — r, 
respectively 

Suppose we are given a ^-tableaux ip Pt q t r having the insertion-extraction pair (p, q) 
and 0-steps at step p + 1, . . . ,p + r and q — 1, . . . , q — r, respectively together with 
its sequence of Young tableaux (J')o<t<i- Then we construct the sequence of Young 
tableaux (Y*)o<t<i initialized Y° = J° = 0: 

• for a — CD-step of the original ^-tableaux, ip p ,q, r , let j be the unique entry 
extracted from Y l ~ l which if RSK-inserted into Y l recovers Y t_1 . Y l ~ l is 
obtained by RSK-insertion of j into Y t , 

• for a 0-step of ip p , q , r , if ^ = q — 1, . . . ,q — r, we add a square and insert 
p + 1, . . . ,p + r. If £ = p + 1, . . . ,p + r, we remove the square with the 
respective entry p + 1, . . . , p + r. Otherwise, we do nothing. 

• for a +d-step of ip p , q , r , Y l ~ x is obtained by removing the square with entry 
t. 

It is straightforward to verify that the above algorithm is welldefined and recovers 
the ^-tableaux from f($ Pt q t r), whence the lemma. See Fig. HJ □ 

We next consider 

(2.3) 77(A) = {te IK A) | Va G A{t), a is an isolated arc} 

and set £f(A) = |T!f(A)|. Given a shape A\ let A*^ 1 denote the shape from which 
A* is obtained by adding a square in the jth row, and A* j 1 denote the shape from 
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FIGURE 4. (a) a *-tableaux i? 1)8)2 in which (1,8), (2, 7) and (3,6) are 
stacked pairs of insertion-extraction steps, (b) 8,2) is the unique *- 
tableau derived from "#1,8,2 in which the 4 steps: step 2, 3, 6 and 7 are 
0-steps. 



which A* is derived by removing a square in the jth row. Thus tracing back a shape 
A* we observe that it is either derived by 

• A*^ 1 (obtained by adding a square in the jth row), 

• Aq" 1 (doing nothing), or 

• A*! 1 (obtained by removing a square in the jth row). 
Lemma 2. 

fc-i fc-i L^irJ 

(2.4) = ^(A- 1 ) + ^^(A}; 1 ) + E (-w-i-^ca}: 1 -*). 



Proof. By construction, +D-steps as well as 0-steps do not induce new arcs. An arc 
a is only formed when removing a square and such an arc is potentially stacking. 
Let 

Gi-xiX)-}) = {{\ h ) Q < h <i-i e Tl^X)-}) I A* \ A^ 1 = -Dj and a is stacking}. 







Thus, for any t E T i c _ 1 (A*_ 1 ) \ Gi_i(A*_ ), the *-tableaux (i, — Qf) is contained in 
T[(X l ). We accordingly arrive at 

(2.5) Z?(A) = TUK- 1 )^ (\J^m) U \p[TU*T-) \ 
which implies 

fc-l fc-1 

(2.6) tf(A<) = tUiX^ 1 ) + ^ttiCA}; 1 ) + £ K-iCA}! 1 ) - ^(A}! 1 )] • 

We next provide an interpretation of Gj_i(A*C )■ Suppose the entry extracted at 
step i is The fact that a is in a stack implies that the (i — l)th step is also 
a — □ step and that the extracted entry is + 1. For $ E Gj_i(A*C 1 ), we apply 
Lemma [T] and replace the insertion of step + 1 and the extraction at step (z — 1) 
by respective 0-steps, and thereby obtain the ^-tableaux /($). We then remove the 
two 0-steps and obtain the unique ^-tableaux 

# E TU^ 3 ), 

where A* can be derived from A*~ 3 by removing a square in the jth row. We next 
claim i?' E T l c _ 3 (X i j Z 3 )\G i ^ 3 (X i j Z 3 )- Suppose E G i „ 3 (A}~ 3 ), then ft contains a stack 
of length three, implying $ ^ Gj_i(A* l 1 ), which is impossible. Therefore, we have 
the bijection 

(2-7) (3: G^Xp) — ► TU^f-) \ Gi-siXp), 

from which we conclude 

^i(A}I 1 )=tt 3 (A}I 3 )-^3(A^). 

Replacing the term g r {X r ^J) and using the fact that for any shape fi, gi(fx) = go(fi) = 
holds, we arrive at 

W ,(A} ; ) = E(" 1 ) P " 1 ^ 2P -i( A 5- 2P " 1 )- 
P =i 
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This allows us to rewrite eq. (12. 6p as 

fc-l k-l L^J 

t-w = n-^ 1 ) + E^-xW; 1 ) + E E c-i^i-^caJ: 1 "*) 

j=l j=l p=0 

and the proof of the lemma is complete. □ 

Lemma [2] allows us to compute the terms t?(A) for arbitrary i and A recursively via 
the terms t c h (\'), where h < i and the shapes A' differ from A by at most one square. 

We next generate a *-tableaux d G Z^(A n = 0) from right to left. For this purpose 
we set fj, 1 = \ n ~ l for all < % < n and initialize fi° = 0. Suppose we have at step i 
the shape /i* and consider the T^_ i (A n ~*)-paths starting from A = and ending at 
A n "* = fi*. 

Corollary 1. The transition probabilities 
(2.8) 



i+1 _ „i+l I vi — ,A — ) J ' 



P(X 4+1 = I X* = n 



y-iL(n-i-l)/2J /"_iYp f c di'+M ■ • , 

^p=o l t„_i_ 2 p-i<,M ; t v ,+i _ _|— | 



where 1 < j < fc — 1, induce a locally uniform Markov-process (X l )i whose sampling 
paths are shape-sequences (//);. 



Let Rand(/i l ) denote the random process of locally uniformly choosing X l+1 = fi l+1 
for given X 1 = fi l using the transition probabilities given in eq. (12. 8p . Corollary [T] 
gives rise to the following algorithm: 



The key observation now is that any core-diagram generated via the above Markov 
process has uniform probability. 

Theorem 1. Any core-diagram generated via the Markov-process (X l )i (by means 
of the algorithm Rand(/i l ) ) is generated with uniform probability. 
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Algorithm 1 Core(n, k) 
l: m +- 

2: while m < n do 

3: ^ m+1 <- Rand(/i m ) 

4: if fi m+1 \ n m = +□ then 

5: insert (m + 1) in the new square 

6: else if fi m+1 \ fi m = -□ then 

7: let pop be the unique extracted entry of T m which if RSK-inserted into 

rpm+l recovers J 1 ™ 

8: create an arc (pop, m + 1) 

9: if (pop,m + 1) is stacking with lastpair then 

10: restart the process Core(n, k) 

11: else 

12: put (pop, m + 1) in the arc set A 

13: lastpair <— (pop, m + 1) 

14: end if 

15: end if 

16: m <r- m + 1 

17: end while 



Proof. Suppose we are given a sequence of shapes 

/ A//-\...,/i o = 

Let U n -i(fi l ) denote the subset of ^-tableaux 

= A , A 1 , . . . , A n_i = fi l 

such that there is no stack in the induced arc set of 

(AV . . , A"-*-\ A™" 4 = //, //"\ . . . , /i° = 0). 

In particular, U n (0) denotes the set of all *-tableaux of shape having at most 
(k — 1) rows that generate only core- diagrams. Let u n (0) = \U n (0)\ denote the 
number of cores of length n. By construction, we have 

f4_»(//) c TUtf), 
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We now condition the process (P)j, whose transition probabilities are given by 
eq. (12.81) . on generating cores. That is, we consider only those ^-tableaux generated 
by (X l )i that are contained in U n (0). Let this process be denoted by (Z l )i. We 
observe 

T^_ i _ 1 (f/ +1 ) n u M (n i+1 ) = u n .^ +1 ). 

Accordingly, using eq. ( 12.81) . we derive for the transition probabilities 



F(Z i+l 

Therefore we arrive at 



P(Z m ) = ] 

p=0 

and in particular 



\U n -i{p)\ 

f/ n _ J _ 1+p (//+ 1 - p )| \U rM (^ +1 )\ \U lM (^ +1 )\ 



\U n - l+p (^-v)\ \U n (fi° = 0)\ u n (0) 
p(z » = 0) _IW=0)|_ 1 



\u n (fi° = 0)\ u n (0y 

which implies that the process [Z % \ generates cores with uniform probability. □ 



3. Modular diagrams 



Any cr-modular diagram can be mapped into a cr-weighted core, i.e. a diagram 
whose arcs have additional weights > a. Suppose we have a *-tableaux of 0, i?, 
whose induced diagram is a a-modular diagram. Repeated application of Lemma [T] 
for each respective stack 

S = ((p,q),(p+l,q - 1),. . . ,(p+ (s -l),q - (s - 1))) , 
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allows us to replace any insertion- step p+ 1, . . . , p + (s — 1) as well as any extraction- 
step q — (s — 1), . . . , q — 1 by 0-steps, respectively. Removing the 2(s — 1) 0-steps 
and assigning the stack-lengths s to the extraction in step q, generates a ^-tableaux 
of with weights, 9 (a- weighted ^-tableaux) . 

Using the correspondence between ^-tableaux and diagrams, a u-weighted core can 
therefore be represented as a sequence of shapes, 9 in which, preceding each extrac- 
tion step, we have the additional insertion of exactly 2(s — 1) 0-steps, see Fig. [5j 
Let Wf(X r ) denote the set of a- weighted ^-tableaux. Each such 9 G W°~(\ r ) induces 



+ 1 +2 +3 + 4 +5 -2 



*0 




012345 6 7 8 9 10 11 i 2 3 4 5 6 7 s 91011 

) -l_0_+3_0_0_0_0_-l -3 

"i — ► a — ► in — *-0 



(b) 0— *-[!]— *-E] I ** J 




345 6 7 8 9 10 11 1234567391011 



(c) +m +B -m, _ -el 

v ; ► a ► gj — ► gj ► m ► 



12 34 5 



FIGURE 5. (a) a *-tableaux whose induced diagram is a 2- modular dia- 
gram, (b) the *-tableaux obtained by repeated application of Lemma [TJ 
The red and blue removed arcs correspond the red and blue 0-steps in the 
♦-tableaux, respectively, (c) the weighted *-tableaux induced by (b) with 
weights 2 and 4 assigned to the two extraction steps, respectively, and its 
induced weighted core. 



a unique *-tableaux, p(9), contained in T r c (A r ) and we have 

h<r/2 

i=\ 

where sg is the weight of the ith. extraction in 9. We set u>f(A r ) = IH^A 7 ")!. 
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Lemma 3. We have the recursion formula 

k-l 

(3.1) Wi (Y) = wUiXl-^ + ^K-iWl 1 ) 

3=1 

k-i L^J Lf J 

+ EE E(- 1 )^ 1 ^^^X- 2s+ i(^: 1 ), 

j=l s=a 1=1 

where p(a, £, a) denotes the number of partitions of a into i blocks, {a 1; a 2 , . . . , a^} ; 
such that Vz < I, > a. 



Proof. Any ^-tableaux 9 G W°(\ r ), where % = r + Y^t=i — 1); s z is the weight 
assigned to the £th extraction step in 9. We consider the weighted ^-tableaux, 9', 
derived from 9 by removing the shape in step r. If A r is derived from A r_1 by doing 
nothing, then 9' G W^I^Aq -1 ). Similarly, if A r is derived from X 7 '^ 1 by adding a 
square in the jth row, we have 9' G W / i °L 1 (A^ 1 ). In case of A r being derived from 
A r_1 via removing a square from the jth row, we are given an extraction step with 
associated weight s. Thus, 

° f e ^-i+el-^-i)^ 1 ) = ^-i)- 2(s -i)(A-: 1 )- 

& determines *-tableaux, p(9') G T^X'r 1 ) \ G^Yr 1 ). Let V^,^,^: 1 ) 
denote the set of weighted ^-tableaux 9\ such that p{6\) G G r _i(AjI 1 ). We set 
vU^s-D^T-) = \V(U)-2(s-iM-)l Note that then 9' G W^d^-i)^! 1 ) \ 
V(ti)-2( s -i)( xr j-)^ whence 

(fc-i 
U^-iC^; 1 ) 
3=1 

(k-i L^J 

u UU [^-i)-2 (s -i)(^: 1 )\^i)- 2(s -i)(^: 1 )] 

yj = l s=a 
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We therefore derive 
(3.2) 

fc-i fc-i L^J 

<(A r ) = wUiK- 1 ) + E<-i( A i^) + EE K^+ilA-: 1 ) - vi^iY- 1 )}. 

j=l j = l s=cr 

We proceed by considering a *-tableaux ( £ V'^_ 1 - ) _ 2 ^_ 1 - ) (AjI 1 ). By construction, 
(r — 1) is a — D-step. Suppose the induced arc of this extraction is a and the weight 
assigned to it is given by s'. Then £ G>_i(A!p ) and we have the bijection 

0-. a-i^: 1 ) — ► r r c _ 3 (A- 3 ) \ g p _ 3 (a;: 3 ), 

obtained by removing the insertion and extraction step of the extracted square in 
step (r — 1). Taking into the account weights, j3 gives rise to the bijection 

i— 2s+l I 
L 2 J 

(3': V ( °"„ 1) _ 2(s „ 1) (Aj._ 1 ) — ► [J [W / '^„ 1) _ 2(s+s ,_ 1) (A^_ 3 ) \ V ( °'_ 1) „ 2(s+S ,_ 1) (A^_ 3 )], 

S '=C7 

from which we conclude 

V (i-l)-2(s-l) (^j- 1 ) = E t U '{i-l)-2(s+s'-l)(Ai- 3 ) _ t; r«-l)-2(s+s'-l)(^i- 3 )]- 

s'=(T 

Using (a) 

. . . a: SlH — hS£ = p(s, £, a )x s 

Sl=(T S£ = (T 

where p(s, £, a) denotes the number of partitions of s into £ blocks of size > a, and 
(b) that for any shape /x, = v^(fi) = 0. We iterate the above formula by 
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replacing the terms u£(A£_) 

s=a 

= E <- 2s+ i(A -EE «- 2s - 2s ' + i(A ^: 3 ) - <- 2s - 2s >+i(^: 3 )) 

S=f7 S=f7 s'=<7 

2(ai+-+s i )<i+l 

= £■■■ E (-i)V- 2(sl+ .- + . )+ i(\ r = 2m ) 

S = CT £=1 

whence the lemma. □ 

Lemma [3] allows us to compute wf(fi) for arbitrary i,fj, inductively via the terms 
Wfr(\) and h < i. We next consider the generation of a *-tableaux, i?, which cor- 
responds to a (T-modular diagram. For this purpose we shall generate a weighted 
^-tableaux 9 £ W^(A m = 0). Taking the sum over all weights we have m = 
n — ^2 h 2{sh — 1)- We construct 9 inductively from right to left setting //' = X m ~ r . 
We initialize /i° = and assign in case of yU r \ = — □ a weight to step r. 
Suppose we have arrived at // ', with the corresponding set of weights, S"\ Consider- 
ing sequences of weighted ^-tableaux contained in W°_ t _y i gr 2 ( Sf -i)(/ ir )5 Lemma [3] 
implies 
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Corollary 2. Let 

t = n-r- 2 (^- !) 

LfJ 

1=1 

The transition probabilities 

F{X r+1 = S r+1 ) | X r = (//, S r )) 



(3.3) 



"^-lO*' 4 " 1 ) r \ ,.r+\ _ cr+l _ or 

u 't r -i(/ jr+1 ) r \ ,,r+l _ r?< Qf+1 _ Or 

^?_l(M r+1 ,s) 



for 1 < j < k — 1, generate a locally uniform Markov-process (X l )i. 



Corollary[2]represents an algorithm for constructing cr-modular diagrams. In analogy 
to the case of core- diagrams, if X successfully constructs a modular diagram, it gen- 
erates the latter with uniform probability, see Fig. [2] (lefthand side). Consequently, 
the process {X l ) n generates random a- modular, fc-noncrossing diagram in 0(n) time 
and space complexity. According to the recursion of Lemma [3], we compute w°(\ l ) 
for arbitrary X 1 with at most (k — 1) rows and all % < n in 0(n) x 0(n k ~ r ) = 0(n k ) 
time and space complexity. 

Theorem 2. Any modular diagram derived via the Markov-process (X r ) r is gener- 
ated with uniform probability. 



Proof. Suppose we have a sequence of shapes 

//,// _ \ • • • = 
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Algorithm 2 Canonical(n, k, a) 

l: m <- 

2: while m < n do 

3: (/i m+1 , size) RandStep(/i m ) 

4: if /i m+1 \ /i m = +□ then 

5: insert (m + 1) in the new square 

6: assign size to the the new square 

7: m m + size — 1 

8: else if n m+1 \ n m = -□ then 

9: let pop be the unique extracted entry of T m which if RSK-inserted into 
T m+1 , recovers T m and let size be the integer assigned to the extracted 
square 

10: create a stack {{pop, m + size), ■ ■ ■ , (pop + size — 1, m + 1)} 
11: if (pop,m + size) is stacking with lastpair then 
12: restart the process Canonical(n, fc, a) 

13: else 

14: put {(pop, m + size), ■ ■ ■ , (pop + size — 1, m + 1)} in the arc set A 

15: lastpair (pop, m + size) 

16: m m + size — 1 

17: end if 

18: end if 

19: m <— m + 1 

20: end while 



with weights assigned to each // 1 \ fi l = — Q,-step and set of weights, S r . Let 

t = n -r- 2 ( s ^ ~ X ) 

and _D^ l _ T ,(/i r ) be the set of weighted ^-tableaux 

A = 0,A\...,A m - r = p r 

such that 

^< \0 \ m— r— 1 \ m— r r r—1 ^< 

= A,...,A ,A =p,/i ,...,/! = 



19 



is contained in W°(\ m = 0). Summing over all weights we have m = n — Yls, ^( s h~ 
1) and by construction, D% n _ r (fj, r ) C Wj 7 (fi r ). In particular 

equals the number of weighted core of length m, i.e. the number of modular dia- 
grams of length n. Suppose now we only consider sampling paths of weighed cores 
generated via {X r ) r (whose transition probabilities is given by eq. ( I3.3P ) contained 
in D a m (0>). We denote the resulting process by (Z r ) r . In view of 

we observe that 

(M^_ 1 (// r+1 )\^ 1 (^ +1 ))n^_ r _ 1 (^ +1 ) = D^^ 1 ) 



Therefore we have 



and consequently 



P(Z 



r+1 



\D a 

1 m—r- 




\D a 





r 



(3.4) F(Z r+1 ) = ] 

p=0 

In particular, 



|£>;u- 1+ > r+1 - p )| _ l^_ r _i(/i r+1 )l 



p(z m = 0i 



An-r +P (/^ P )l |D^° = 0)| 

= 0)| 1 1 



\D^° = 0)\ \D^(ji° = 0)\ <(0)' 
That is, the process (Z r ) generates modular diagrams uniformly and the theorem 
follows. □ 



In Fig. [61 we showcase two paths constructed via the process Canonical(n, k, a), 
for n = 8, k = 3 and a = 2. In Fig. [7] we construct the corresponding 2-modular 
diagram from the red path displayed in Fig. |6j 
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fail 




1/1 



Ms 

□ 2 
1/1 



^ 1/1 




FIGURE 6. Building weighted *-tableaux via the transition probabilities 
given in eq. (|3.3() . Here, the top path fails to generate a 2-modular diagram 
while the red path succeeds. According to Theorem [2] each such modular 
diagram is generated with uniform probability. 



(a) 
(b) 



-a -a +S +m 



r3i^[3i 



■a -a 



12 3 4 



12345678 



(c) 0^m^\m 



12345678 



FIGURE 7. (a) the red path of Fig.0 (b) the *-tableaux derived by adding 
four 0-steps in (a), (c) adding two pairs of insertion and extraction steps, 
which produces the 2-modular diagram. 
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